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TRAFFIC MATRIX ESTIMATION METHOD AND APPARATUS 

[0001] The present utility patent application is a non-provisional of provisional patent 
application, "FAST ACCURATE COMPUTATION OF LARGE-SCALE IP TRAFFIC 
MATRICES FROM LINK LOADS," Serial No. 60/398,474, filed on July 25, 2002, the contents 
of which are incorporated by reference herein. 

BACKGROUND OF THE INVENTION 

[0002] The present invention relates generally to network management and, more 
particularly, to estimation of traffic matrices in a network. 

[0003] A traffic matrix provides, for every ingress point i into a network iand egress 
point j out of the network, the volume of traffic T t j from i to j over a given time interval. The 

traffic matrix, together with network topology and routing and fault data, can facilitate diagnosis 
and management of network congestion — as well as provide critical inputs to network design, 
capacity planning and business planning. Unfortunately, traffic matrices are generally 
unavailable in large operational Internet Protocol ("IP") networks. Rather, typical production 
systems gather data on resource utilization at network nodes and links (e.g. link loads); end-to- 
end performance metrics for specific transactions (such as one way delay statistics for packets 
exchanged between measurement servers at the network edge); and status and configuration of 
network topology and routing. Though these may reveal traffic anomalies or congestion 
problems, they do not in general reveal potential solutions. For instance, link load measurements 
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may reveal congestion on a link, but shed little light on its cause, which in general requires 
understanding the traffic matrix. 

[0004] The inability of network operators to measure the traffic matrix is a fundamental 
obstacle to developing sounds methods for network and traffic engineering in operational IP 
networks. 

SUMMARY OF INVENTION 

[0005] The present invention is directed to mechanisms for measuring traffic volume 
from a plurality of ingress points to a plurality of egress points in a large scale network, such as 
an IP backbone network. The present invention infers the traffic matrix advantageously from 
widely available link load measurements, such as SNMP data. First, such data is collected and 
used to construct a gravity model of link to link traffic to capture an overall distribution of the 
volume of traffic. Additional information on routing between points of ingress and egress for 
traffic flows can be incorporated to obtain significantly improved results, e.g., the model can 
incorporate information to model traffic exchanged with peer networks in a typical IP backbone 
network. Second, the traffic matrix is estimated by determining the matrix that minimizes a 
distance metric to an initial tomographic solution based on the gravity model, subject to 
tomographic constraints. Quadratic programming can be utilized to determine the solution in the 
space of those admitted by the tomographic constraints closest to the solution obtained by the 
gravity model. This step advantageously does not require (higher-order) statistics or additional 
traffic modeling assumptions. Applying network configuration and routing data to remove empty 
demands from the traffic matrix serves to dramatically decrease the problem dimension of 
computing the pseudo-inverse of the routing matrix. Then iterative proportional fitting can be 
used to ensure that non-physical results are replaced. The resulting traffic matrix comprises 
elements that accurately estimate the traffic volume from the plurality of ingress points to the 
plurality of egress points in the network. 
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[0006] The present invention is especially accurate for large elements and is robust, 
easily coping with data glitches and loss. The present invention is fast and flexible, easily 
extending to incorporate more detailed measurements where available. The present invention 
enables true network engineering, such as reliability analysis, traffic engineering, and capacity 
planning, in an IP framework. 

[0007] These and other advantages of the invention will be apparent to those of ordinary 
skill in the art by reference to the following detailed description and the accompanying drawings. 

BRIEF DESCRIPTION OF DRAWINGS 

[0008] FIG. 1 is an abstract diagram of an illustrative IP backbone network. 

[0009] FIG. 2 is an abstract diagram of three points of presence in the illustrative IP 
backbone network. 

[0010] FIG. 3 is a flowchart of processing performed to determine a traffic matrix, in 
accordance with an preferred embodiment of an aspect of the present invention. 

[001 1] FIG. 4 is a flowchart of processing performed to remove the empty demands in 
the initial tomographic solution, thereby reducing the complexity of the tomographic problem, in 
accordance with an embodiment of another aspect of the invention. 

[0012] FIG. 5 is an graph abstractly illustrating how the gravity model solution is 
utilized to obtain the final solution using a tomographic approach. 

[0013] FIG. 6 is illustrative MATLAB source code for computing a weighted least- 
squares estimate of the traffic matrix. 

DETAILED DESCRIPTION 

[0014] FIG. 1 is an abstract diagram of an illustrative Internet Protocol (IP) backbone 
network. The IP backbone network 100, as depicted in FIG. 1, can be represented as a set of 
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nodes and links, associated with the IP routers 101, 102, 103, 104, etc., and the IP adjacencies 
between those routers. The nodes and links wholly internal to the network 100 are referred to 
herein as backbone nodes and links; the others are referred to as edge nodes and links. A typical 
IP network managed by an Internet Service Provider (ISP) will connect via edge links to other 
autonomous systems (e.g., to a public Internet exchange point or directly to a private peer or 
transit provider) and customers (e.g., to a modem bank for dial-up users, a web-hosting complex, 
or a particular business or university campus). It is helpful to further categorize the edge links 
into access links 150, which connect to customers 160, and peering links 140, which connect to 
other non-customer autonomous systems, e.g., peers A 1 10 and B 120, as depicted in FIG. 1. It is 
also helpful, without loss of generality, to characterize those routers that terminate access and/or 
peering links as an edge router (ER) and those that terminate only backbone links as a backbone 
router (BR). FIG. 2 further illustrates this terminology. Three points-of-presence (PoPs) 201, 
202, 203 in the IP backbone network are depicted, each with a number of BRs 221, 222, 223, 224, 
225, 226 and ERs 231, 232, 233, 234, 235, 236. The links between BRs are referred to as core 
links, while the links between a BR and an ER are referred to as non-core links. 

[0015] It is assumed that there is access to routing information in the IP backbone 
network 100. For example, in large IP networks, distributed routing protocols are used to build 
the forwarding tables within each router. It is possible to predict the results of these distributed 
computations from data gathered from router configuration files. See co-pending commonly- 
assigned United States Utility Patent Application Serial No. 09/876,384, entitled "TRAFFIC 
ENGINEERING SYSTEM AND METHOD", which is incorporated by reference herein. There 
may be more than one route between two routers even using only shortest paths. It is assumed 
herein that traffic will be evenly distributed across all such routes, although the present 
methodology can be easily adapted by one of ordinary skill in the art to handle unequal 
distributions. 
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[0016] It is also assumed that some form of link load measurements are available in the 
IP backbone network 100. Although the present techniques can work with different types of 
network load data, including flow data, the present invention advantageously can take advantage 
of data widely available via the Simple Network Management Protocol (SNMP). See, e.g., D. 
Harrington, et al., "An Architecture for Describing SNMP Management Frameworks," Internet 
Engineering Task Force, Request for Comments 2571, April 1999. Since every router maintains 
a cyclic counter of the number of bytes transmitted and received on each of its interfaces, it is 
possible to obtain basic traffic statistics for the entire network with little additional infrastructure 
support. SNMP data notably has many limitations, which, preferably, should be taken into 
account in implementing a useful methodology. Data may be lost in transit, either due to SNMP 
utilizing unreliable UDP transport or to data loss when copying to a central archive. Data may be 
incorrect, e.g., through poor router vendor implementations. The data collection sampling 
interval is typically course, e.g. 5 minutes. Many of the typical problems in SNMP data may be 
removed with minimal artifacts using simple techniques. For instance, using hourly traffic 
averages, with five minute or finer data polls, mitigates the effect of missing data substantially. 
Slightly more sophisticated methods of anomaly detection and interpolation can produce even 
better results. It is advantageous to retain some of the unaveraged data for brief periods for 
troubleshooting and alarming. 

[0017] The traffic matrix can be computed with different levels of aggregation at the 
source and destination endpoints, for instance, at the level of PoP to PoP, or router to router, or 
link to link. Intuitively, the aggregation level needs to be sufficiently high so that the traffic 
exchanged between different locations is not sensitive to the detailed composition of the traffic. 
On the other hand, when the aggregation level is too high (e.g., PoP to PoP), ISP routing policies 
operating at a more granular level may have a profound impact and can introduce serious 
systematic distortion to the overall traffic pattern. The inventors have found it advantageous to 
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utilize router to router matrices, which are appropriate for a number of network and traffic 
engineering applications — and which, where useful, can be used to construct more highly 
aggregated traffic matrices (e.g., PoP to PoP) using routing information. Given two ERs £ { and 

Ej, the traffic between these edge routers is defined as the total amount of traffic that enters 
the network at E, and exits at with T = {T tj } the associated matrix. Similarly, given two 
BRs, the traffic between these backbone routers T? is defined such that the elements of the 

associated matrix, T B = {T? } , refer to traffic that enters and leaves the core. The traffic matrix t 

will often be referred to below in a vector form, in which the indices of the vector refer to 
source/destination pairs. 

[0018] FIG. 3 is a flowchart of processing performed to determine the network traffic 
matrix, in accordance with a preferred embodiment of the present invention. The processing can 
be readily performed by software or firmware on a fast computer processor, such as a Sun 
Microsystems 336 MHz UltraSPARC-II processor, with access to the above-mentioned link load 
measurements and routing information. • •«■••• 

[0019] At step 301, the link load measurements are collected. Where, for example, 
SNMP data is being utilized, an SNMP "poller" can be readily implemented in the network to 
periodically request an appropriate Management Information Base (MIB) from the relevant 
devices in the network. The data can then be stored in an archive and averaged, aggregated, or 
interpolated, where appropriate, before further processing. 

[0020] At step 302, the overall distribution of traffic in the network is captured using a 
simple model of the link-to-link traffic. For example, one of simplest approaches to modeling the 
link-to-link traffic is to apply what is referred to in the art as a "gravity model". A general 
formulation of a gravity model is given by the following equation: 
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X, , = — J - 

fu 



where Xy is the matrix element representing the force from i to j; Ri represents the repulsive 
factors that are associated with "leaving" from i; Aj represents the attractive factors that are 
associated with "going" to j; and f itj is a friction factor from / to j. Gravity models, which take 
their name from Newton's law of gravitation, are commonly used by social scientists to model the 
movement of people, goods, or information between geographic areas. In the present context, X ifj 
can be interpreted as the traffic volume that enters the network at location i and exits at location j\ 
the repulsion factor Ri as the traffic volume entering the network at location /, and the attractivity 
factor A y as the traffic volume exiting at location / The friction matrix,/,, can encode the locality 
information specific to different source-destination pairs. It is necessary to approximate the 
actual friction matrix using models with fewer parameters. In the embodiment described herein, a 
common constant is assumed for the friction factors, arguably the simplest among all possible 
approximation schemes. The resulting network gravity model simply states that the traffic 
exchanged between locations is proportional to the volumes entering and exiting at those 
locations. 

[0021] A simple gravity model can be utilized to estimate the amount of traffic between 
edge links as follows: Denote the edge links by U, l 2 , .... The volume of traffic T(l h lj) that enters 

the network at edge link /, and exits at edge link / ; is estimated. Let T™ nk (/■ ) denote the total 

traffic that enters the network via edge link l h and (/, ) the corresponding quantity for traffic 
that exits the network via edge link / f . The gravity model can then be computed by either of 



7 



Attorney Docket No. 2002-0174 



T ou (I ) 

l link 



Tli n k (h ) 



l link \ L j) 



Tlink (h ) 



The first equation states that the traffic matrix elements 7(/„ lj) are the product of the traffic 
entering the network via edge link /, and the proportion of the total traffic leaving the network via 
edge link while the second is reversed and is identical under traffic conservation - that is, the 
assumption that the interior network is neither a source, nor sink of traffic. While this assumption 
is violated (e.g., protocols running in the network interior act sources arid sinks, and packet drops 
act as sinks) the volumes involved are insignificant in the network considered. Most notably the 
actual results from the two equations are almost identical. 

[0022] It is possible and preferable to generalize the above simple gravity model to take 
into account a wide range of additional information provided by link classification and routing 
policy. For example, typically, in large North-American ISPs, the majority of traffic is 
exchanged between network customers and network peers. The pattern and handling of customer 
and peer traffic are quantitatively different, and this has a large impact on the traffic matrix. 
Furthermore, this peering traffic has a large impact on every aspect of network design and 
engineering, and so estimating the associated traffic matrices is very important. It is 
advantageous to adapt the gravity model to specifically differentiate between customer and 
peering traffic. 

[0023] A generalized gravity model can be constructed as follows. It is assumed that the 
network has a set of peers labeled P u P 2 , and exchanges traffic with peer P, over a set of edge 
links dedicated to this peer. This is commonly termed private peering and is the dominant mode 
of peering for large IP backbones today. A set of customer access links are labeled a h a 2 > and 
a set of peer links are labeled p h p 2 ,.... The set of edge links carrying traffic to peer is denoted 



8 



Attorney Docket No. 2002-0174 



by ^ u and the set of all peer links by ^\ The set of all customer access links is denoted by A 



SNMP measurements provide volumes of traffic on all edge links, j: T lir ^ k (j) , where the 

superscripts in (out) denotes traffic into (out of) the backbone. The traffic entering, or exiting the 
network to peer P h is 



where x = in or out. Then, the outbound traffic from access link a t e ^ to peering link p m e 



is 



^outbound {&i ) Pm ) — ^ 



0, otherwise. 



where X is the exit peering link, as explained below. The inbound traffic from peering link p, to 
access link aj is 



Unbound (Pi, a j) = Tttokipi) ^ fl i 



The internal traffic from access link a t to access link a s is 
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internal (&i } Ctj) — tHTTI \ Eternal 



where 



Eternal ( a i) = ^nkOfy') " Abound (Pfc » ^i) 



These equations are developed from the following assumptions, which reflect dominant Internet 
and ISP routing policies: 

[0024] Transit peer (peering link to peering link) traffic. It is assumed that the volume 
of traffic that transits across the backbone from one peer network to another is negligible. Transit 
traffic between peers may reflect a temporary step in network consolidation following an ISP 
merger or acquisition, but should not occur under normal operating circumstances. 

[0025] Outbound (access link to peering link) traffic. The proportionality assumption 
underlying gravity modeling is applied on a peer-by-peer basis: that is, the traffic exiting a 
specific peer comes from each access link in proportion to the traffic on that access link. It is 
assumed that all of the traffic from a single access link to the given peer exits the network on the 
same peering link (as determined by the Interior Gateway Protocol (IGP) and Border Gateway 
Protocol (BGP) routing configuration). The exit peering link for traffic from access link a x to 
peer P, is denoted by X(a h Pj) t which may be derived from routing configuration information. 
The assumption is typically true in practice, except for example when short-term load balancing 
is performed. In such situations, the above methodology could be supplemented with available 
statistics on the affected prefixes, though the inventors' experience has been that the impact is 
small and does not affect the accuracy of the traffic matrix computation. 

[0026] Inbound (peering link to access link) traffic. A network operator has little control 
over the injection of traffic into its network from peer networks. Accordingly, it is assumed that 
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the traffic entering from a given peering link is split amongst the access links in proportion to 
their outbound traffic. 

[0027] Internal (access link to access link) traffic. It is assumed that the fraction of 
internal traffic from a given access link a { to a second access link aj is proportional to the total 
traffic entering the network at a i9 and that the traffic between the links can be computed by 
normalization. 

[0028] The generalized gravity model has been found by the inventors to match actual 
Internet data very well. One possible explanation for this is that geographic locality is not a 
major factor in today's Internet, as compared to ISP routing policies. As long as the gravity 
model captures the essence of the routing policies, it becomes very accurate and the choice of the 
above-mentioned friction factor is less critical. It is certainly possible to further improve the 
above method by using a more accurate model with additional parameters. However, the margin 
for improvement may be limited. 

[0029] With reference again to FIG. 3, the result of the link-to-link gravity model at step 
302 is a traffic matrix that is not guaranteed to be consistent with the internal link measurements 
in the network — a significant drawback. This is remedied, in accordance with an aspect of the 
invention, by utilizing a tomographic approach. It is not expected nor required that the above 
gravity model accurately model the traffic between all source-destination pairs. In fact, one 
would naturally expect certain pairs of locations to stand out from the overall distribution, simply 
due to their specific characteristics (e.g., going through transoceanic links). Rather, the model 
need only capture the overall distribution. It is expected that tomographic estimation can be 
utilized to correct most of the violations in the assumptions underlying the model and thus 
significantly improve the accuracy. 

[0030] Tomographic methods are based on the system of linear equations x = A t 
where t is the traffic matrix, x represents the link loads, and A the network routing matrix. The 



11 



Attorney Docket No. 2002-0174 

traffic matrix t is written as a column vector t = {t u t 2 , An) T > where the M traffic matrix 
elements, t r , are the traffic between the rth source/destination pair. The link traffic is the sum of 
the traffic matrix elements that are routed across that link. The traffic (as measured in packets or 
bytes) that traverses the L links of the network during some period is represented as the set of 
observables x = (x\ 9 x 2 , x L ) T . The LxM routing matrix A = { A ir } where 



where F ir is the fraction of traffic from source/destination pair r that traverses link I In essence, 
the equation x = A t states that the traffic matrix must be consistent with network routing and 
measured link loads throughout the network, not just at the edge. This matrix equality is, 
however, highly under-constrained, and so allows many solutions. Tomographic methods differ 
in how a single "best" solution is identified from the possibilities. The majority of existing 
statistical tomographic approaches (commonly referred to as "network tomography" methods) use 
models of the higher order statistics of the link load data to create additional constraints. The 
present invention advantageously does not incorporate additional constraints, but rather uses the 
gravity model to obtain an initial estimate of the solution, which is further refined to satisfy the 
constraints. 

[0031] With reference again to FIG. 3, at step 303, the results of step 302 are 
transformed into an initial traffic matrix solution, referred to herein as t g . It is advantageous and 
preferable to transform the link-to-link model results into a more tractable problem, such as a BR- 
to-BR traffic matrix using routing information. BR-to-BR traffic matrices are generally more 
useful for traffic engineering tasks such as load balancing and are absolutely necessary for 
link/router failure analysis. In addition, more highly aggregated traffic matrices, such as POP-to- 




Fir, if traffic for r traverses link i~- 
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POP traffic matrices, may be directly derived from BR-to-BR matrices using routing information. 
Even considering only a hundred backbone routers, this leads to a problem with over a thousand 
unknowns, which is orders of magnitude more than the available constraints on link traffic 
volume. 

[0032] Accordingly, at step 304, the complexity of the tomographic problem for the 
border router traffic matrix is advantageously reduced, in one embodiment, by removing empty 
demands from the initial traffic matrix solution. First, since there may be multiple BRs within 
each PoP, traffic will flow only between the closest of these as determined by IGP routing — 
thereby, rendering many of the BR-to-BR matrix elements empty. As depicted in the simplified 
illustrative topology of FIG. 2, there are two BRs in each PoP, connecting ERs within the PoP 
with redundant links. Given shortest path routing (and equal link weights on backbone links), it 
can be seen that all of the traffic from PoP B 202 to PoP C 203 will traverse the route through 
BRs 222 and 223, while there will be no traffic entering the backbone nodes at BR 221 and 
departing at BR 224. While this is a very simple example, in operational IP networks, the set of 
paths consistent with IP routing will typically be significantly less than the set of .all. paths ... 
between router pairs. 

[0033] FIG. 4 is a flowchart of processing performed to remove the empty demands in 
the initial traffic matrix solution, in accordance with an embodiment of another aspect of the 

invention. The traffic matrix from BR B t to B } is denoted T* . At step 401, all elements of the 

BR to BR traffic matrix are initially marked as empty. At step 402, the routing protocol, e.g., the 
Interior Gateway Protocol (IGP), is simulated to find the shortest paths between each source and 
destination router. At step 403, for each path, let B t and Bj be its first and last BRs respectively, 

and mark T ( f as not empty. Then, at step 404, remove all that remain empty. This step is 

equivalent to removing elements from t that will be zero because the corresponding route is not 
used (unless failures occur). 
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[0034] Step 402 in FIG. 4 would typically be done for each pair of edge routers, which 
can be prohibitive due to the large number of routers. However, the "topological equivalence" of 
edge routers can be exploited to avoid having to run simulations for all possible pairs of routers. 
, Two edge routers are said to be "topologically equivalent" if they connect to the same (non- 
empty) set of BRs and the protocol weights on the corresponding links are the same. (It should 
be noted that there may be multiple layer hierarchy of ERs within a PoP, and lower layer ERs 
pass traffic to BRs only through the higher layer ERs. The lower layer of ERs may be removed 
from consideration as their (network compacting) traffic can be seen at the higher layer.) Such 
equivalent edge routers can be grouped together and considered as what the inventors refer to as a 
single edge router equivalence class (EREC). The routes between the component ERs of the 
same pair of ERECs advantageously should be the same except for the first and last links. 
Consequently, only one IGP simulation need be run for each pair of ERECs. Computing routes 
on the above basis can reduce the computational burden by a factor of 20. After eliminating all of 
the empty demands, the number of unknowns can be reduced by a factor of 10, thereby turning a 
highly under-constrained linear inverse problem (with potentially hundreds of constraints and 
tens of thousands of unknowns) into a moderately under-constrained problem (with hundreds of 
constraints and about a thousand unknowns), and making the computation orders of magnitude 
faster. 

[0035] Finally, at step 305 in FIG. 3, an optimization-based tomographic approach is 
utilized to refine the initial gravity model solution. As depicted abstractly in FIG. 5, the final 
estimate of the traffic matrix is selected by choosing the solution in the space of those admitted 
by the tomographic model that is "closest" to the solution obtained by the gravity model, in 
accordance with some form of objective function or distance metric. For example, the gravity 
model may be refined using a least-squares solution that minimizes the Euclidean distance to the 
gravity model solution subject to the tomographic constraints, as depicted in FIG. 5. More 
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specifically, the following quadratic program could be solved 



min || t - t g j| 
s»t. || At — a: || is minimized 



where | . || is the La norm of the vector, i.e. the Euclidean distance to the origin). As another 

example, a weighted least-squares solution could be utilized, in which the projection onto the 
subspace is not orthogonal, but rather weighted by a function of the size of the estimated traffic 

matrix elements (t g ). That is, the equation ||(t - t g ) /w| could be used as the objective function 

to minimize in the above quadratic program, where w is the weight vector, and / is the element- 
by-element vector division. As illustrated in FIG. 5, the simple least-square solution is just an 
orthogonal projection of the gravity model solution onto the constraint sub-space. The weighted 
least squares solution, on the other hand, gives different weight to different unknowns in the 
solution. 

[0036] Note that the tomographic constraints may be ill-posed due to possible 
dependency among different link constraints. Furthermore, the constraints may not be satisfiable 
due to error and noise in the link load data or possible routing changes that are not captured by 
the topology data. The standard technique for dealing with ill-posed quadratic programs is to use 
Singular Value Decomposition (SVD) of the routing matrix A to compute its pseudo-inverse. 
The resulting solution is the closest to the initial solution among all solutions that minimize the 

discrepancy against the tomographic constraints ( At -X ). Routines to compute the pseudo- 
inverse are available in many numerical computing packages, e.g. MATLAB. FIG. 6 lists 
MATLAB source code that implements such an approach. 
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[0037] It should be noted that, where an approach such as least-squares is utilized, the 
methodology may result in negative values, which are clearly without physical meaning. 
Accordingly, at step 306 in FIG. 3, these non-physical values should be corrected. One approach 
to avoiding these values is to view the problem as a constrained optimization problem. However, 
a simple iterative procedure provides a fast and effective alternative. Specifically, Iterative 
Proportional Fitting (IPF) can be utilized to ensure non-negativity. See J. Cao, D. Davis, S. V. 
Wiel, and B. Yu, "TimeOvarying network tomography: router link data," J. Amer. Statist. Assoc., 
Vol. 95, No. 452, pp. 1063-1075 (2000), which is incorporated by reference herein. For the 
initial estimate, the traffic matrix estimated above can simply be used, with zero replacing the 
negative elements of the matrix. IPF then proceeds by successively refining the estimate using 
the above-disclosed method. The computation is simpler and the initial condition is not complex, 
since higher order statistics of the process are not modeled. 

[0038] The above embodiment of the present invention is remarkably fast, taking as 
little as 5 seconds on a 336 MHz UltraSPARC-II processor to compute a backbone router to 
backbone router traffic matrix on a tier-1 IP network. . . .. . 

[0039] The complexity of the generalized gravity model is 0(N ) in the number of edge 
links being considered, but the number of operations per term is small. The worst case 
complexity of the above quadratic program is linear in the number of unknowns (elements in the 
traffic matrix), and quadratic in the number of constraints. In practice, however, the complexity 
of singular value decomposition methods is generally less than this. For instance, the SVD used 
in MATLAB are usually better than this complexity estimate would indicate. In reality, this 
computation can run significantly under 2 seconds. Computation of the generalized gravity 
model for a complete network on the order of 1000 routers can take less than 3 seconds. 
Computing routes by taking advantage of edge router equivalence classes can reduce the 
computational burden by a factor of 20. The time taken in computing the routing matrix 
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dominates all other aspects of the above embodiment, taking two to three minutes. It should be 
noted, however, that this cost can often by amortized over multiple traffic matrix computations 
because the routing matrix need be recomputed only when the network topology changes. The 
method used to reduce the problem size can be performed as part of computing the routing matrix 
with a computational cost that is a very small marginal cost on top of computing the routing 
matrix itself. 

[0040] The inventors have tried the above embodiment utilizing straight least-squares 
and a range of different weighting schemes, including constant weighting, linearly proportional to 
the terms in the gravity model traffic matrix, and/or proportional to the square root of the gravity 
model. The inventors have generally found that (a) the simple gravity model is better than the 
raw least-squares approach, (b) the generalized gravity model is better than the simple gravity 
model, and (c) the best results come from using the generalized gravity model with the weighted 
least-squares method using square root weights (though the improvement over using other 
weightings is small). Moreover, the inventors have found that, even where there are errors in the 
traffic matrix, the results can still be used for a range of operational tasks. such as detecting traffic. . 
changes. The overall approach is robust to measurement errors on the observables. 

[0041] The foregoing Detailed Description is to be understood as being in every respect 
illustrative and exemplary, but not restrictive, and the scope of the invention disclosed herein is 
not to be determined from the Detailed Description, but rather from the claims as interpreted 
according to the full breadth permitted by the patent laws. It is to be understood that the 
embodiments shown and described herein are only illustrative of the principles of the present 
invention and that various modifications may be implemented by those skilled in the art without 
departing from the scope and spirit of the invention. For example, the detailed description 
describes an embodiment of the invention with particular reference to IP backbone networks. 
However, the principles of the present invention could be readily extended to other network 
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architectures. Such an extension could be readily implemented by one of ordinary skill in the art 
given the above disclosure. 
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